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BLAST 
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Taxonomy 



Struct^ 



S&UST 1 SEQUENCES BjmTS VERSION BUSTN JNav-W-2000] 



Sequence 1 gi 4£3X106 Sequence 3 from Patent WO9740167. Length 8209 (1 3209) 
S+qttt&c* 3 gi 36303 1 1 Gibbon leukemia vim*, complete gtaora* Xxagtb *088 (1 $0$&) 

2_ 



-The statistics (hitsaor* and eacpaci value) U calculated bated on th* sia* of nr dat&bwo 

If protein translauoa is reversed, please repeat the jsearch wich fevtrse strand of ihe query seqtwsnce 

Score = 1583 bit* <212SD) , Expect - 0.0 
Identities - $776/50*3 (64%), Gaps = 1625/&943 (18%) 
Strand ** Pitta / Pius 



Query; 
503 CC: 



12 



tggtgtacgactgtg-ggccccag — cgegct-tg — gaataaaaatcctcttgctgctt 

i i i i niii i mm i i in ii iimn mmuii 1 1 

tagag-a~gactgagccgcccgggtacccg , -tgtgtccaataaaa--cctctt.gctgatT 



Query: 



5$ 



67 



gcatcaaga-ccgct — tctcgtgagtgat-^ttg gggrgtcgect — ct-tccga- 

MlM It III I MUM III III Mil II 111 III II 
goATtfOggagccg-tggttitcigt tgttect-tgggAgggtttctcctaaccac-'Egac 



Query: 

sbjctr 



104 -gcccggac-gagggggattgt-tcttttactggcctttcatttggtgcgttgg-ccggg 

lltl M Mill I i Mil 111 II II II II Ml 
122 cgccc— actxeggggg— tctcacxtttgggggc tc crteca — ggatcacra 



Quajry; 
Sbjct: 



1 60 aa atcctgcgaccacc — ccttacacc — c — gag— aaccgacttgg — ag-gta 

It MM 1IMIM II MM I Ml II I I 111 II M 
169 aaccccacccagggaccaecgacc — -caccaacgggaggtaa — g-c-tggecageg-a 



Ouery: 
Sbjct: 



205 nag — g-g-atc-ccctt-tg-g — aa — cminhnnmumnnc-ggcc — g-gegtc — 

I M M I lit II I III II II I Mil 

220 ccgttgtgtgtctcgcttctgtgcctaAgtc-cgcaattctgaccgtccttgtgtgtctc 



Query; 
Sbjct! 



247 — -tctgt-tctga-— gt--gtctg-->t— —tttcggtga-tgcgcgc tttegg 

1 M 11 1 M M 1 1 MM I II II I Mllll! Ml I I 
279 gQttctgtgtctgagacscgtaactetgaetgeectc — gtaagtgcgcgcarttttttgg 



Query; 
sbjct:: 



287 tttgeag-etgtcctctcagaccgt-aagga— crg-gag-gae-rgsga-t — c-agca 
III Ml MM Ml 1 III) I III Ml I M Mill I Mil 
337 ttt-cagtctgt— tc-egg — -gtgaatcaetctgcgagtgacgtgtgagtagcgaaca 



Query: 
Sbjct: 



33S gacgxgctaggaggatcaeeggct-gccaccctgggggacgccccgggaggt-~- gggga- 

MUM I II II 111! MM Ml Ml MM III Ml I MMI 
390 gacgtgttcgg-ggctcaccgcctggtaatccagggdgacgtcccagga— tcaggggag 
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Query; 
sbjct: 



392 gagccaggg&egcctggtggtctcctactgtcggtcagaggaccgagttctgttgttgcva 

If II M II III! MIMil t it I If til I I II I It (III III 
447 ga-ccagggacgcctggtggac-cc — c — tcggt-a-a cg-ggtc-gttg-tgac 



Query: 
Sbjct: 



452 gegaaagcttc-cccctccgcg — g-c-cg-tccgactct-tttgcctgct-tgtggaa 

III I II III II I 1 I I II 1 1 II III II MUM) 
4$2 ccgatttcatcgcccgtctg-gtaagacgcgctctgaatctgatt — ct-ctctctcg- - 



Query: 
Sbjct! 



403 gacgcgg*cgggteg-cg-tgtgtctgg-atctgttggtttc-tgtctcgtg — tgtctt 

I lit I III II II Mill Ml II HIM II It II I It 
54 6 gtcgc — c — tcgccgccgtctctggtttctttttgtttc-gtttc-tggaaagcctc 



Query: 557 

tfJRAL CORE PROTEIN I 

Sbjct: $98 

gag polyp rote in 1 



tgtcttgt~gc»gtccttgtctac — agt-tttaatatgggac-agacagtgactaccoc 

H <3 Q T V T T P 
II J|| I III II III I J I I I I 1 1 I I I I I I I 1 1 I I Ulllll 
eg — tgtcacagtctttctctcccaaatcatcaatatgggaeaagataat-tctaccco 

M G Q P N S T P 



Query: 611 

VIRAL CORE PROTEIN $ 

Sbjct: 654 

gag polyprotein 9 



ccttagt-t- — tgactctcgaccattggactgaagtragatccagggetcataatttgt 

LS LTLDHWTEVRSRAKNL 

ii i i i mil I ii mi M n m i niiiiiJ in i j 

ta-tctccctcactctaaatcactggagagatgtgagaacaagggctcacaatctat 

XSLTItNHWRDVRTRAHNL 



Query: 667 

\TtW> CORE PROTEIN 27 

Sbjct: 710 

gag polyprotein 27 



cagttcaggttaagaagtfgaccttggcagactttetgtgcctctga^tggccaacattcg 
5VQVKKGPWQT FCASEWp? F 

iii i ii i mm miiimmiri mi n mil iimii 

ccgtggaaatcaa^aagggaaaatggcagactttctgttc<^ccgagtggcccacattcg 
SVEJKKGKWQTFCSSEWPTP 



Ou**V: 727 

VIRAL CORE PROTEIN 47 

fcbjcrts 770 

gag polyprotein *47 



atattogatgg£:c J atcagaggggAcctTit*A^---tctgaaax^atc6cer9otgttaagg 
DVGWPSEGTPN SEIXLAVK 

ii 1 1 -i mn mi ii i ii .nun mi mi i n inn 

GVGtiPPEGTFNLSVl F A V K 



Query: 

VIRAL CORE PROTEIN 
Sbjct i 

gag polyprotein 



784 caatcatt — tttca-gactggaccGggctctcatectgatcagga$ — ccctatatcc 
66 X t I fQTGPGSHPDQE P * I 

ii m mn n ii in nm n n n n mm 

825 -aaagattgtctttcagga — ga-acgggggacatccggacca— agttccatatatcg 

66 K I V P Q E HCGHPDQ V P * I 



Query; Q38 

VXRRL CORE PROTEIN 84 

Sbjct: 878 

gag polyprotein 83 



t— >tacgtgqca&gatttgg — cagaagatcctccgccaLgggttaaaccatggctaaat 
L TWQDL AEDPPPWVKpWLN 

i n urn n i i mi mi ii iiiniii m i n 

tggta-^tggcaggacetcgeceaga — atcccccaccatgggt— *geca-gcet 

V V WQDLAQ N P P P W V PA 



Query; 

VIRAL CORE PROTEIN 

Sbjct: 

ga$ polyprotein 



6 94 aaaccaagaaagec-aggtccccgaatcctggetcttggaga — gaaaaacaaacact~ 

103 KPRKP CPRXtALCSt K W K H 

11 [II Kill 11 I II Mil II I Ml II 11 I 

926 cc gccaaggtcgctg-- t— tgtctct — gatacccg-aagac- — cagtt 

95 3 A K V A V V S P T R R P V 



Query: 943 

VIRAL CORE PROTEIN 121 

Sbjct s 367 

gag polyprotein 113 



-cggccga-aaaagtcgagc-cct cttcctcgtatctaceccgag-atc-gaggagc 

S A E K V E P SSSYLPR D RCA 

in it i n tii in i ii i mn mi i i i 1 1 1 1 i 

gcgg-ggaggcea-tc-agctcctccccgaccccccatcta-cccg-gcaacagacga-c 
A G R p S APPRPPIV P ATDD 



Query: 

VIRAL COPJ PROTEIN 

Sbjct; 

gag polyprotein 



1001 -^-cgcegact-tggccggaaccccaacctgttcccccaccccctt&tecagcacagggt 
133 A D LAGTPTCSPTPLSSTG 

iii iii i mm u i mn in nm n n 

1021 ttactcctcctct ctgaaccc^-ac— g—cccccgcec tatcegge — — ggc 

131LLLL SEP T PPP Y P A A 
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Query: 

VIRAL CORE PRGTElJl 
SbjCTIS 

gag polypr&teln 



1057 gctgtgaggggacctct-geecctc— c-—t~gga~-ge--tccgg tg g-t 

157 CCSGT5AP P GAP V V 

|(| 1 I 1 1 1 f Itltlli j I III I! I! II II I I 

10S5 actgcca*«ccccctccggcccctcaggcgatccraaccgccgtcacroccaa3tacccgat 
14$ LP PPLAPQAXGpPSGQMPD 



Quary; 

VXRA& CORE PROTEIN 
Sbjct: 

gag polyprotein 



1097 -ggaggga-cct— gctgcegg — gactcggagccggagaggcgcc-aceceggag — eg 
171 EG P A X G rRSRRSATPE R 

1 II If III | III I I II IN ( I I 111 1 1 1 J III! II f 
1123 agtagcgatcctgag^ggccagccg-ctggga-ccag-gagtcgccgcgoccgcagtcca 
155 SSDPEGPA AG T R S R ft A R S P 



Query: 

VIRAL CORE PROfEXN 
3b jot i 

gag poXyprotein 



114$ gacagactfaga-tegca--at-atta66octffcacacci:at^f<rfieetit^— »*eeA^-^ — 
IS? TOE I A 1 L-PL*RT*GP P M 

I llfll I I III I 1 I I It II! I 1 I II Mill IMI I 

184 AD HSGPDSTV ILPLRAIG 



Query; 

VIRAL CORE PROTEIN 
Sbjct: 



1197 — ecagg— «g-ggeeaattge — ageccctc--cagtattggccettttcttctgcagat 
204 ? G GOL OPL QYWPFSSAD 

II II II illll H I MIM II Hllllli UNI fl Mill I 
1234 cccccggccgagcccaatggcctggtccctctaeaatattggcctttttcctcegcagat 

203 »P^EE>MCLvptQVWPP3SXP 



Quary; 

VIRAL CORE PROTEIU 
Sbjct: 

gag polyprotein 



1216 ctctataa^ggaaaactaaccatcccccttcctcggaggatcecoa — aegecteaegg 
221 LYN^KTNHPPrSEDPQ R L S 

II f I 1 I 1 1 1 1 1 f 1 1 III) MM1I Mil M M I fill 1 t MUM I 
1294 ctttataattggaaatctaatcatccctctttttcrga—aaacccagcaggtcrcacgg 

ZZZ LYHWK5NIf?3FSS N P X G L X 



Query; 

VIRAL CORE PROTEIK 

Sbjct: 

gag polyprotein 



1306 ^gttggtggagtcccttatgttctctcaccagoctacttgg^atgattgtcaacagctgc 

340 CSLVESLKFfiKQPCWDDCQQX* 

If t I IMM 1 1 M 1 11 M M 1 1 ( 1 1 1 1 I It I M M MMI MIMIM I 
13S2 ggctccttgagtctcttatgttctcccatcagcccacttgggae^artgccaacagctcc 

24lGti&E2LMFSHQPTWDDCO0Xi 



Query: 

VIRAL CORE PttOTEIM 
sbjett 

gag polyprotein 



13$6 tgcagacactcttcacaacegaggagcgaga^agaattctgttagaggctagaaaaaatg 
2S0LQTLFT?EERERIttEARKN 

| lilt II Mill M MIM M it IHMUI l Itllf t MUM ( 
1412 racagattcttttcaccactgag^aacgggaaagaattetccrggaggcccgcaaaaaxg 
2fflLQXLFTXEERERlLLEARKiJ 



Query; 

VIRAL COKE PROTEIN 
Sbjct: 

gag polyprotein 



1426 ttcc-tggggccgacgggcgaccc-acgcag-ttgcaaaa — rgagattgacatg-g^a 

260 VP G A D G R P TQ L Q N ElDM G 

IM IMM I I 111 I Ml 1 1 Ml I 1 I M 1 II! i Ml II 
1472 -tccttggggacaatggg-gcccctacacagctcg-agaacet — catt-a-atgaggce 
281 V £ G D H G APTQL E N 1 X HEX 



Query: 

VIRAL CORE PRQXEXK 
fib jet r 

gag polyproteut 



1479 tttcecttgactcg-cc-c-cggttgggactacaacacggctgaaggragggagag-ct" 
298 fPXiXR P GWpYHTAEGRE 3 

11 Ml I 1 Ml M 1 I MIM lltlllil 1 1 1 J 1 1 1 1 1 1 1 1 1 1 II 
152$ ttcccectcaatcgacctcac — tgggattacaacaeagccgcaggtagggagcgtctt 
2$9 F P L R P H WDYWTAXGRERL 



Query: 

VIBAL CORE PROTEIN 
gag polyprotein 



1534 -tgaaaatctatcgceagg-ctctggtggcgggtcte— cggggc-gcctcaa^cggcc 
316 L K I Y R Q XLVAGt R G X S R R J? 
II IIM MM II MM Mill MIMI IMM I Hi (Mil 

Ififii A«g— gtctaee^cc-^^aat^tagty^tt^gtctoaea^^^oa^-otc — ggegcee 
31fl L V Y R RTLVAGLKGA A R R * 



Query: 

VlRAl CORE PKOTETtf 
Sbjct: 

gag polyprotein 



1589 cactaatttggctaaggtaagagaggtgatgcagggacc-gaaegaaectccc-tcggt^ 

335 THLAJCVREVMQGP STEPP S V 
It MMMMMMf IMM Mllilllll I I MIM 111 MIM 

1635 taccaarttggctaaggtaagagaggtcttgcagggaccggca-gancc-ciscttoggt- 

336 TKLXKVREVLQGPX PSV 



Query; l€47 t^totr5«3a-*9g^t«atggaag«t-^*Agg*ggtt;c;4«=*5C£ictcrgot-ccXaccrCc 

nttp/AvvwiKbLitfm^ 21/02/01 



HRR 23 2031 B5:43 
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VIRAL CORE PROTEIN 
Sbjct: 

gag poiypratein 



354 PLC 
1 1 1 1 1 I 111 



RIME 

i u tint 



uii i in i i in ii nun ill i n 



1692 tttett-agaacgcctgatggaa^Ut-a^agatacactccgtttgatccct-ct 
35$ r t ERLMEXYRRYTPrO?S 



VXRAI* CORE PROTEIN 
Sbjct: 

gag polyprotein 



VlfcXI* CORE PROSRXN 

Sbjct: 

gag polyprotein 



VIRA& CORE PROTEIN 
Sbjct: 

gag polyprotein 



Ouery: 

VIRAIi CORE PSOtEIS 
Sbjct: 

gag polyprot^tfi 



Query: 

VIRAL CORE PROTEIN 

Sbjct t 

gag polypretain 



Query: 

VIRAL CORE PROTEIN 
sbj4t = 

gag polyprotein 



Gutty; 

VIRja CORE PROTEIN 
Sbjct: 

gag polyprotsin 



mi i? mi ii ? ii in iimn " n 11 1 "I'i'I 

1749 tgagggaca-^fi^gctgcggtcgccatggcctttacoggacagtcagcccc^gatatca 
374 E <3 Q QXAVXMXftCQSAFPJ- 

1762 ggaagaaactt-cagagactggaagggfctacaggaggctga — gttacgtgatctagtg 



392 



R K 

III 



K 

in 



a r 

niii 



III 



o e 
Mill 



L R 
Id! 



D 

III 



L V 

III! 



1808 aga*-*aaagttacagaggctagaggggctccagga~ ct-attcctt*caagatttagta 

3*3 K ft KiQRI*B6^QD Y 6f X* si P *» v 



1818 agagaggnnnnnnnnnnnnnnnn^ 
U1 M 1 1 U 

:a< 

412 X E A E 



4U 

1864 UigUW**^ 



1878 nmuronnwiiura^^ 

«X £K£REEREE %t mil M IIM II II IIIlT 

1324 

1837 taaga-tcttggccgcagtggttgaag--g-gaagag-c*^ 

451 K I L A A V V E G KS 5 R E R K u 
lit I || I 1 1 I I | 1 1 If M III IIU1I II III! 

Id83 taaaattc-tggccgcagtagt— aagtagagaagggtcca-caggtaggeag^cag— 

452 K I I. * A V V 3REC5TSRQT 

1391 ttttaggaaaattaggtcaggccci^gacagtcag^ga-acc^gggcaat--a|gacc 

A64 yftKIRSGPROS 0 N I* GN R * 

tin i ii u ii n n 1 1 n mi in n i im 

2036 iiaacctgaa--ca-acc--aa C ca--aagaagacar=tiaggg--atg|aagacc 

4S9 G N L S N Q A KKTPR PGR* 

2046 -ccactcgacaaggaccagtgtgt^tattgtaaagaaaaaggacactgggcaag^aact 
AZ-t PLDKDQCAYCKEKGHWAK « 

tii it mil iiitim n n itiiini n n 11 ninni m i 

2082 tccactagaeaaagaccagtgcgcatactgtaaagagaagggccattgggcaagagaa^t 
434 pLpKDQCAYCKBKGHWARt. 



Qu*ryt 

VIRAL CORE PROTEIN 
Sbjct: 

gag polyprotein 



Query: 

VIRAL CORE PROTEIN 
REVERSE TRANSCRIPTASE 

Sbjot: 

pol palyprotein 
gag polyprotein 



Query: 

REVERSE TRANSCRIPTASE 
Sbjct; 

pol polyprotein 



3104 
505 

2141 
504 



i im ii n n in it "I ii inn Hill i» urn 

gtccccgaa-aa aa*cacgtc4gag*agcc-a«ggttetztgeect«- agata- 

C P R K 



KHVREA KVLAL 



2156 agattaggggagacggggttcggaccccctccccgagcccagggtaactttgaaggtgga 



£24 

1 



0 



G 



R R 



GSDPLPEPRVTLKVE 

iiiimmiinmm n iimn i m mi i 



i 

£20 



G3QGSDPLP 



19 l» y f v . .T. ....... . .* iiiiiiiI. II 1 III 



2247 «iitUiU-iwtt«tm» 



nmiimm n inmn n nmim 11 1 m 



ccaa 



hnp://www>nd>Lito.t^ 
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2027856687 



T-049 P. 37/53 Job-494 
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Quary; 

R8VRRSE TRANSCRIPTASE 38 



Sbjct; 

pol polyprotoin. 



2274 -ccattaggaaaacta aaagimnwmntcctgggtgatgggt:gccacagg--gea 

? £ (3 F L KEfCK3»VMGA*G <3 

IIJ I Ulllt II II N II II II II UNI IN 

230$ ccca-tgggaaaagtagggtceagacggacggtcgtg ga-aggagegacaggcagea 

38 PWGKVGSRRTVV EGATGS 



REVERSE TRANSCRIPTASE 55 
Sbjct; 

pol polyprotetn 



2327 acgg-cagtatecacggactacccgaagaacegtt-- gacttggga-gtgggaegggta* 
R QYPWTTRRTV OLG VGR V 

MM 11 11 HIM II 111 It If I II HI I II I] t 
2360 a-rfgtc— tacccctggaccaeaaaaag-acttttaaaaatt-ggacat-aaacaagtga 
56 JC V Y P tt T 1 K R LLKt GH K Q V 



Qu»ry; 

REVERSE! TRANSCRIPTASE 
Sbjct;: 

pol polypf^tein 



2383 ccciictcgtttctflgteat-ecctgagtgcccag-tacccct-'tctaflgtagagactti^c: 
74 THSFX.VJ P E C P VPL £ G R D I* 
KIlMi |] Mllllll 111 1 1 1 1 f 1 1 1 M Mill I II II III I I 

74 TH$FI»VXP E C P A PL !• G R D I« 



Query: 

REVERSE TRANSCRIPTASE 93 
SbjCt j 



2440 tgaccaaga-t-ggga^ewaatttc— tttt—gaac^aaggaagaccagaagtgtc- 

L TK M G AQIS T E Q G R P £ V S 

I HIM II I II II I I I IMI I I Ml I HI I M i 
2471 taaccaa-atstaaag-gcccaga-tccagttttccg— ctgagg—gcccacaggtaaca 

93LTKLKAQIQPS A? GPQVT 



Query- i4 92 TgrgaataacaAacecateactgtgttgacc — ctec-aa — ttagatgaT^naratcga 

REVERSE TRANSCRIPTASE 1X1 VNNKPXTV1T X. Q J. X> D S J * 

1| | | III I I I M I M M I I I I M II I H II mil HI 
sbict . 2524 tggggagaac—gccc- t-actatg-tg-cctggttotaaacctggaagAagaaraccga 

* 0;,C - w 5 E RP TM C £VliNl*EBEYR 



pol polyprotain 



111 



Query: 

REVERSE TRANSCRIPTASE *20 



sbjct; 

pol polyprorein 



2547 ctdtat-tetccccaagtaaagcctgatcaagatat— a—cagtcctggttggagcagt 

in ii ii iii i ill it in i ii mm I in J 

2578 ctacargaaaagoc-agt— accctcctc tatcgaccca-tcctggct«eagc-t* 

129 If H £ K ? V P 3 3 TDF3»X.QI* 



Query 

REVERSE TRANSCRIPTASE 147 F P Q 

mini 

Sbjct; 

pol polyptoteln 



2602 ttc6cca-agtsctgggcagaaaccgcagggatgggtttggcaaagcaagttcccccaoa9 
A W A ETAGHGLAKOVPPQ 

i iiiiiiiiii ii n mil I H ii mi nti 1 III 
2fi29 ttccccactgt*tgggcagaaagagccggcatggg*ctagccaatcaag-tece*c-cag 
146 FPt V»A8RAGMGLANQ VP P 



Quary: 

REVERSE TRANSCRIPTASE 1S7 



SbjCt : 

pol polyprotein 



2€61 *gttattca-a-ctgaaggcca-gtg-ctacaccagtatca^-teagacagtaccccttg 



V I 0 t K A S ~ A T P V 5 V R Q Y P I* 

Mi M Ml 111 U Ml II IMIIM I I U UN II II U 
2667 tggtaet-agagct-aagatcaggtgcct-caccagtggctgrce-gacaatatccaatg 

lfiS V V V EI* RSGA SPVAV RQYPH 



Query; 



2715 agtagagaggctcgagaaggaatttggccgcatgttca-aagattaatcc-a-afi — agg 



REVERSE TRANSCRIPTASE 165 3 R E A R E G I W P H "V Q R l< X Q 0 

ii i in inn inn M inn i ii mi ill i n ii i 

2743 agcaaagaagctcgggaaggtatcagaccccacatccagaag-t — tcctagaccxagg 
184 3KEAREGXRPHIQK FLDLG 



Sbjct: 

pol polyprotain 



Qu»ryi 

REVERSE TRANSCRIPTASE 
pol polyp*rgtdin 



2770 gcatcctagt-tcctgtccaatc-cccttggaatactccc-ctgctaccggttaggaagc 
203 GIL V PVQS PttUTP LLPVItK 

i hi ii inn i n in i i 1 1 1 m i m imiiii n i mi 

273d 9 -^trttggtgC«Otgt-cgg^og«5c-tggAattto-ccotatgot^Cctgtaaa^A«ac 
202 VLVPC RSP WKT » x T «, ™ v «- 



p L X. P V K K 



Query j 

REVERSE 



TRANSCRIPTASE 



2827 
222 



ctgggaccaatgattatcgaccagtacaggacttgagagaggtoaataaaagggtgcag^ 
PGTKD^RPV QDLREVN K. a VQ 



Sbjct: 

pol polyprotcin 



i iimimii MiH nm inn mini i minimi mi 

2855 cagggaccaatgactatcggccagttcaagacctgagagaaattaataaaagggtacagg 
221PGTNDYRPVQDLREIN. fCRVQ 



hUpy/www.ncbi.ntoLn&^^ 21W2/01 



MPR 23 2001 05:45 



0141 221 7739 PAGE. IS 



MAR-23-01 10:52 From: FOLEY, HOAG & ELIOT LLP 
Blast RcsuH 



2027856687 



T-049 P. 36/53 Job- 



Knmfe«;E TRANSCRIPTASE 242 DIH PTVPNBYN LfcSALPPEF 
REVERSE TRANSCRIPTASE D ^1 H (| J ( | |||n mmj) (J , (( ||(| m| |||m , , 

3b x ct . 2916 atattcatccc-acagtcccaaacccttacaatcttc-t^a^ttcccttccgcct-agc:- 

pol polyprotein 241 D I H P TVPNPYNL t S S L F P S 



REVERSE 



TRANSCRIPTASE 262 N WYtVtO fcKPAFFCLRLH 

I || 11)111 III! II It M II I i I I I I I I IIIIIIII II Mil I I 

3^j CC t 2971 tatacttg^tactcagtcttagatc-tcaaggatgcctttttctgcctca^gcracatcc 

pol polyprotein 260 YTWVSVLD LKDAFFCLRLH P 



Query: 

REVERSE 



TPANSCKXPTAfiC 



pol polyprotetn 



3001 ccactagccaaccactttttgccttcigaatggagagatccag-gtacgggaagaaccggg 

390 PflSQPI-FArEWJtDP G ? G R T C 

t k nin ii it it n inn fin in nn i n n u n 

3030 caac-agccagccgctgttcgcgttcgagtggaaagacecagaaaaaggtaa-cacaggt 
2B0 M aOffcrAPeWXDPEKGN T © 



Query: 
REVERSE 



TRANSCRIPTASE 



sbjct: 

pol polyprotein 



3060 cagetcacetggacccgactgccccaagggttcaagaactc-cccgac-catetttgacg 
300 gfcTWTRiVQGFKNS Pt X F D 

inn mum m u n hum m mill ti ui n i mi mi 
3098 cagotgacctggacgcggctaecacaagggttcaagaactctccc-actc-tcttegacg 



Outty: 3118 aagccctacacagggacctggc-caacttcaggatc—caaeaecctcaggtgaccctcc 

REVERSE TRANSCRIPTASE 31*EALHRDI*A WFRI Q H P Q V T L 

I 1 1 1 1 1 m m i mil m lit i nil m Mint ii 

sbjct* 314S aggccctccaccgagatttggctc-cctttagggccctcaac-ccc-caggtggtgttac 

pol polypjroteift 31BEALHRDLA P F R A L N P QVVL 

Quary: 31*7 5 tccagtacgtggatgacctgcttctggcgggagccacc&aaera-g«gactgcttagaag 

REVERSE TRANSCRIPTASE 338 IiQYVDDLLLAGATKQ PCIE 

tin 1 1 1 1 1 1 1 1 1 1 1 1 n in ii in in i i i mm i m 

smce: 3203 tecaatatgtggacgacct-ct — tggtggccgeceecaeatatgaagactgcaaaaaag 

pol polyprotein 337 X, Q Y V D to L LVAAPTYBDCKK 



Query: 3232 gtac— gaaggcactaQtgctggaatt— gtctgacctaggctacagagcctctgctaa 

REVERSE TRANSCRIPTASE 351 G T KALJ.X.E1, 5 D I* G * R A S A X 

1 u in ii ii i I in ll i i i i 11 Ml i i n inn 

Sfcjo*, 32fi0 g aacae»gaa-gctt^--*aeaggagttaag--taagtr3gggta<;cgggtatQgsct*a 

pol polyprotain 356 G ? G K X- I. G E L 3 



JCLQVRVSAK 



OUftrv- 328? gaaggcccagatttg-caggagagaggtaacata-cttggggtaoagr-ttgc-ggggcg 

REVERSE TRANSCRIPTASE 376 K A 0 I C R R £ V T Jf L 6 Y S L K G 

Hilllllll I II 11 lllllt II 11 fl f llllllll I II II I 
sbjct* 3315 gaaggcccagcrctgc^a-gagagaagtcacctatc-tggggtac-ctact-caaggaag 

pol polyprotein 375 K A Q L C Q REVTt L G Y • * " * 



It L K E 



Oucrys 3343 9gQft ^ ? a^30t^a-^«5*«ef^o<fflAa^aaaaetg-tagtcc*gat»«c3gecc<saaj 

REVERSE TRANSCRIPTASE 394 G G R tf t T E A R K K * V V Q I P A P * 

I | mini II I It II HI llll II I I 11 II I 11 II 
Sbjct: 3371 gaaaaagatggctaacccca-gcccgaaaggctactgtta-tgaaaatccctgttcctac 

pol polyproteiji 393 G K R W L T P X R K A T ~ " ~ « » » » 



M K 1 P V P T 



Query: 

REVERSE TRANSCRIPTASE 
Sbjct: 



3401 cacagccaaacaagtgagagagtttttggggacagctggattttgcagactgtggatccc 
414 T AKQVREFIGTAGFCRLWIP 

U III lit II I II III I II II 11 lllll Mil! 1 1 MIIMII 
3429 gacccccagacaggt^cgtgaarrtctaggcactgccggattctgcaggctctggatccc 
413 2frR Q VflBFX<GTACrc;*LWIP 



Quary: 

REVERSE TRANSCRIPTASE 434 



SbjCC: 

pol polyproteiu 



3461 ggggtttgcgaccttagc^gc-cccactctacccgcwaccaaagaaaaagggggattct 



GFATLAA PL¥PI*TK1*KGGF 

miimii ii i ii ii in i inn inn mn mim 

3489 tgggtttgctcccctggctgc&ccc-ttgtaccccctaacaaaaga gagca-tc- 

433 GF AS1AAPLYPLTKE SI 



Qu6ry , 3$20 cc tgggctcetgag caccagaaggcatttgatgc-ta^aaaAggccctg 

REVERSE TRANSCRIPTASE 453 3 WAPE MQKAFDA IKK A J- 



MAR 23 2001 05:45 



6141 221 7739 PAGE. 19 



MAR-23-01 10:52 From: FOLEY, HOAG & ELIOT LLP 



2027856687 



T-049 P. 35/53 Job 
l*ft£e7of 16 



Sbjct: 3*41 
pel polyprotein 450 



M | || Kill [[ HI I Ml HIM I M Mill Ml M 

cettttatttgg— aetgaggaacatcagcaggcttttga-ccacataaaaaaagccttg 

PFIW fEE'JiQQAFO HIKKAL 



Query: 
Sbjct; 

pol polyprotein 



3570 cta*-affC^cacctgc-tctggccctccctgacgtaactaaaccctttaccctttatgtg 
470 L flAPA LAi**X>V*KPFfftYV 

Ml Ml I Mill I W II 1 1 1 1 1 1 III I M 1 1 M * I H 'I HI I 
3598 ctgtcage^-cctgcat-tggccctcccagacctcaccaagccattcactctatatata 
463 ISA F A LALPOLTXPFTLYX 



Query: 

REVERSE TRANSCRIPTASE 
Sbjct; 

pol polyprotein 



3627 gatgagegtaag^ggagtagcccgaggagttttaacccaaaccctaggaccatggagga 
4fl& 0 E H K GVARGvtTQTLGPWR 
llllll I |l 1 1 M IMM MMI J M M M MIMM I I I I I 
3655 gatgagag—agocggcgtggcccggggagtgctcactcagactttaggaccctggcggc 
488 D t & AGVARGVLTQ-*"""" 



T L 0 P W R 



Query; 

REVERSE TRAHSCRT PTA5E 
pol polyprotein 



3685 gacetgttgcctacctgtcaaagaagcttgatctitgtagceagtggttggcc— cgtatg 
508 RP V AYLSKKLDPVASGWP V C 
| M I! 1 1 It M Mill II M Mill M Mill II IMM I M 

3713 c<7ccagtagc4tatct:>tcaaaaaaactgaatccggtggccagc:0tf(^tf<f^ aac — ct « 
507 RP V AYLSKKIiDPVASGWPT C 



Query: 

REVERSE TRANSCRIPTASE 
pol polyprotein 



3743 tctgaaggctatcgeagergcggc-catactggtcaaggacgctgacaaattgactttgg 
52S Jj JCAXAAVA ILVKDADKLTL 

111 I i 1 1 I MMI M II I 1 M MM MIMMI II M I) Ml I 
3771 ct^gaaagoggttgcagcagtagcac-tcct.tctcaaagacgctgataagttaa<5<5ttgg 

527 I» K A V A A V A LlrLKUAOK 



Ii T 



Query 9 

REVERSE TRANSCRIPTASE 
pol polyprotein 



a ftOa « acAff a*tata*«tgtaat:agccccccAt-gcjittggagaa-catfig«ccggcagcccce 
547 GQNXTVIAPH A L E H I V R Q P * 
(HI Ml I Mill II If IMM II I M M (Mill Mill IMM 
3830 gacaaaargtgactgtgattgcttcccatagc-ctcga-aagcwtQgtgcggcAACcccc 
5*6GQNVTVXASK$ LE SIVRQPP 



ouerv- 3660 agaccgatgcfatgaeoaacggccgcatgacccactatcaaagcct:gct-.tctcac3-g4Cf 

MM womu. «. . » «,»,«,» » » "J « ,| ,? ,•„»,* , » ,*, ,« 

Sbjct ; 3888 cgaccggtg^atga^caatgccagaatgaetcattiaccagagcctgctgt-t-aaatgaa 



Query: 



3318 agggtca-c-tc^-cgctccaccagccg-^qtcaaccctgccactcxtctgcctgaaga 

toon »»»»» «« fil y y 1( f ,* ,f ,» * , t ,t ,Y. tT n n n .i .? . E .f 

3946 agergt-arcgtctgcgc-cc-cctgct^to-ctaaacccsgctaccct^ettccagtcga 
585 R V S F A P PAV I,NPA:CLJ-PVE 



pol polyprotein 



Query: 

REVERSE TRANSCRIPTASE 
Sbjct: 

pol polyprotein 



3374 gactgatg— a-acc^gtgactc^tgattgccatca-actat— t-gattgaggagactg 

60S T B $ PVTHDCHQ L LIBET 

MMI I MUM I M I Ml Ml I II I I DM MM 
4002 «tcggaagccaccccagtg*cacagg"tgc**tcaga-aatcctcg-ccgaagaaactg 
604 5 t ATP VHR C SB I L ABET 



Query: 

REVERSE TRANSCRIPTASE 
Sbjct: 

pol polyprotein 



4027 g-ggtccg-caaggacctta-caga-cataccgctgactggagaagtgctaacctggt-t. 
622 G VR K P & TP ipilGSVLT* ? 

I Mill MM M lit 1 1 IM Ml M I MM llllllll 1 
4055 gaact-<:gacga-gacc-taga*^acca-accattgccogg-g— gtgccaacctggrat 



Query! 

REVERSE; TRANSCRIPTASE 640 
flhjct: 

pol polyprotein 



4082 eaccgacggaagcagcratgtggt — ggaaggtaaga-ggatg-gctggggcggc-agt: 
T D GSSYVV EGK R M A G A A V 

It Mill MIMMI I IIMIMM I 111 I II DIM I I I 
4108 -acagacffataacaa-t-t-tcatcacggaaggtaa-acoaa-^fagcaggflaccccoa-t 
639 T P GSS riTECK R RA6AP I 



413C ggtggacgggacccgcacgatctgggccagcagcctgccggaaggaacttcagcgcaaaa 
655 VD<5TR?IWA5SIiPEG*SAQK 
|| || || 1 M Ml I MMI IMMMMM MMI 1 1 Mill M I I 
hup:/A>ww.nctiri^ 2V02/O1 



Query: 

REVERSE TRANSCRIPTASE 



MAR 23 2001 05:46 



0141 221 7739 RRGE,20 



MAR-23-01 10:51 From:FOLEY, HOAG & ELIOT LLP 

Blast K£9Utt 



2027856687 



T-049 P. 34/52 Jcb-494 
Fogc 8 of 16 



Sbjct: 

pol polyprocein 



4161 cgtagatggcaagcggacggtatggga 
$$7 VOGKRTVWA 



ctagcagcctgccagaaggtacgtcagcccagaa 
SSLPEGTSAQ K 



Query: 

REVERSE TRANSCRIPTASE 
shjct: 

pol polyprotein 



4136 ggctgagctca- tggcectcacgcaagctttgcggctg gccgaagggaaatccataaaca 

$79 ABL MALTOALRXiAEGKSXN 

luiii n i i in i mil u n u 1 1 1 1 r 1 1 1 1 > i tit in mi 

4221 qgctgaact-agtagccttgacgc-aggcatt^cgcctggccgaAggaaaaaacatcaaea 
671 A E L VA&TQALRLAE5KNIH 



REVERSE TRANSCRIPTASE 
pol polyprotein 



4255 
698 



tttatacggacagcaggtargcctttgegaccgcaeaegta-- cacggggccatctataa 
IYTDSRYAFATAHV HGAIVK 
| f! It ill II M IIII1M I filll MIJl I! If 1 1 Itll! ti UN! 

43$ o r$ra<>aQ3g*cagcaggtA*gettt:tgccactgctca-^t*t*catggggcaa*»ea*aa 
69€irTD3RVAFATAK I K G A X Y X 



Query: 
REVERSE 



TRANSCRIPTASE 



Sbjet: 

pol polyprotein 



4313 acaaao«gacrttgcttacctcagcagggagg<faaar-aaagaa«^iaagag^ftaaT:tctaa 

"318 QRGIJ.TSAGRCI KNKEEXL 
|f MUM llll II fl It (I I It II III MMIMMMIM! I 
433S gcagag^gggctgctcacttctgctgg^aaagatatcaaa-aacaaa^aggaaairt-ttg 
?1€ QRGtLTSAGKDXK H K E E I I* 



Query • 



4373 g-ootattagaagcottaoatttgOOaAA^aggptagOtattatac^Atgtootggafia^ 
REVERSE TRANSCRIPTASE 737 5 LLEALHLPtRItAlIHCPGH 
REVERSE TWOTCKJFWIM. m| ||( , ,,, , ,j , „ , (| MMllMMI II 

4395 gccctgctagmggccatccatctccctaggcgggtcgccarcatccactgccctggccac 
735 ALLEAIHLPRRVAIIHCPCH 



Sbjfit: 

pel polyprotein 



Query: 

REVERSE TRANSCRIPTASE 7*7 



Shjcti 

pol polyprotein 



4431 ca^aaa^ccaaag^atctcatat— ctagagggaaccagatg^er^accg-ggttGrccaa 



0" K "A K DL I SfcGMQWADtt VJLK 
|tl II I 1 I I I I I I 1 I lllllll I I I ■ I II 11 II tM II 
44S6 eag — aggggaagtaae-cctgtggccactgggaaccggagggeega-cgaggctgeaaa 
755 0 HOSH PVATGHARAO E A A K 



Query: 

REVERSE TRANSCRIPTASE 77 € 
SbjCt: 

pol polyprotein 



4487 gcaggoagcccaggctgttaacc— ■-- • ttctgcctataatagaaaegceqaaagcc-cca 
Q A AQAVN I,J,PItet?KA P 

HI II III llll III I III I 0 in mi II I i 

4512 geaagecgee— > ^ctgtcgaceagagtgctggc aggaae-tacaaaacetcaa 

774 0 A A LSTRVLA GTTKPQ 



Query; 



REVCR55 TRANSCRIPTASE 794 



Sblct: 

pol polyprot«ir> 



4542 gaacccaga-cgacagtacaccctagaagactggcaagagataaaaaagatagaccagtt 



MUM II II HI I II Hill II "Hill 

4561 g-agcca-atcg--ag tcc—g ct—caaga — aa- — agaccag — 

790 E P X E P A Q E K T R 



Query; 



REVERSE TRANSCRIPTASE 814 



Sbjct: 

pol polyprot«ia 



4601 ctctgagactccggaggggacctgotacacctc^tatgggaaggaaatcctgccccacaa 



SETPEGTCYTSYGKEILPHK 
I IM1 (Ml ill HI 1 I H II f til 

4593 — -g cc-ga-ggga— *get-cacc-cct — — — cegg— gga~aa 

800 PREtTP D RGK 



Query: 

REVERSE TRANSCRIPTASE 634 



SbjCt; 

pol polyprotein 



4$$1 agaagggttagaacatgtccaa-caganacatc-gtctaacccacctaggaacta-aaca 



E G L S Y V Q Q I H R L T H I* G T KH 

llll II 111 II I i WWW It llll III llll II I HI 

4623 agaa — tt—cat-t aageggttacatcagt-taactcacttagg-accagaaaa 

811 £ F I FRLHQt^HLGPEK 



Query: 

REVERSE TRANSCRIPTASE 952 
Sbjct: 

pol polyprotein 



471S -c— ctgcagc-ggttggtcag-aacatcc-CCttatcatgttctgaggctaccaggagt: 
L Q Q1VR TS PYHVLRLPGV 
I I ] 11 ! It II Mill II HI HI I n II 111 I I 
4671 gcttcteeaacrag-tgaaccgtaccagcctcctcate — — ccgaacct-cca— a-T 
82fi LI.Ql.VNRtSr.iX PNLQ 



Query: 

REVERSE TRANSCRIPTASE 
Sbjct: 



http://www.nchi.nlm.nik^^ 



4772 ggctg-a-ctcg-gtg^teaaacatt— gtgt — gc-cctgocagctggttaatgc— -t 
8T1 ADSVVKH CV PCQLVNA 

in i in i iii i ii i fin i i mil ii mm i 

4721 — cxgeagttcgcgaag^c-accagtcagtgtcaggcttgtgcw-' tgnctaaxgcggt 

21/02/01 



NAR 23 2001 05=47 



0141 221 7739 PAGE. 21 



MAR-23-0! 10:51 From: FOLEY, HDAG I ELIOT LLP 2027856687 T-049 P.33/53 --Jcb-484 



pal polyptoteia 



843 S A V R E V TSQCQACA M T N A 



Ouarv* 4876 actgggaagt^dac^ccactgaggtagagcc-^taaacacggaaacaaatatctatcg 

REVERSE TRANSCRIPTASE 905HWEVDPTEVKP AKreMKYL t 
REVISE TRANSCRIPTASE n ^ (| J||||m „ |||||n |( , f I ■ I I I I f Mill II 

sh A Ct . 4829 actgggagg tagacttcacagaaataaagcctgg-tcggtatggaaacaagwtctgtt a 

doI polyprotein 879 rWEVD?TEIKPG RYCN KVLL 

oua-v. 492fi gcttttotagacaccttt-tcacaataggtagaggcttatcctactaa ga*a-gag 

REVERSE TRANSCRIPTASE 92$ V * V D I F SGWVEAYPTK K 6 

REVISE TRANSCRIPTASE (j ( |(|| (j n|nM(||| ,,,, |Mlli (Ml I I 

fi M Ct - 4888 gcattcAtagata-ctttctccggatgggtagaagcttttcctaccaaaactgaaacg-g 

Jol Jiiyprotein B99 V FID T F 3 G « V E A F P T K T 8 T 

Ouerv- 4939 aettca4ecgtggtggctaagaaaatactgg^gga4atttttccaagatttggaaracct 

RTOERSE TRANSCRIPTASE 943 T 3 T V V A K K I I. $ ? .* 

Sbjeti 

pol polyptotcin 918 A Ii 



| I || Hi II Ml Mill [. I 1 1 1 1! II M tl I II II II III 
q uSf*fc« 4946 pcct-aatcgt-ctg— taaaaaaar^rtagaagaaattctaccccgcrrcgggatccct 

S3 »W A I, I V C KKXLECILPRFCIP 



Q VI SQ49 aaggtaatagggtcagacaatggtccag-ctttcgttgcccaggtaagtcagggactggc 

Rtf/SRSE TRANSCRIBE M3 K V I G 3 0 N G P A F V A Q V S Q 0 L A 

mini i luiiiiiiinii ii i mi iini J i r i ii i II 1 1 1 mi 1 1 r 1 1 

oMet* 5002 aaggtacrcgggtcagacaatggcccggccttt-g^tgctcaggtaagtcagg^acrggc 

pol poly*r<>tein 93? K V L G S D N G P A F VAQVSQGL* 



QUftrv . fiioa ca-agatattggggattgattggaaae^goattgtgcatacagaccccaaagctcaggac 

REVERSE **AHSC*XPTASB 993 K I I, C I D W K 1 fc C A V R P O S S G 

it i t 1 1 1 1 1 1 1 mini I llllllll I I I f • I I I i ■ UIIIIM I 
S Mcti 5061 cactca-actggggataaattggaagttacatcgtgcgtatagaccceagagctcaggtc 

t>Ol polvprotein 957 TQLGINtfKLHCAYRPQSSG 



Q Uftry * 51S7 aggtagagaggatgaatagaaccattaaagngaccctt-accaaatt-gaccacagagac 

REVERSE TRANSCRIPTASE 1002 Q VRRMWRTXKETL 

M M 1 1I I I II I M 1 1 1 1 I I 1 1 M M II I I I II I 1 1 M I M 1 1 1 I I M I 1 
Sb*ct\ 5120 aggtagaaagaatgaacagaacaattaaagaga-ccttgaccaaattag-ccttagagac 

pol polyprotein 9-7$QV&RMNRTXKE T J, T K I ALE? 

Ouery; 5225 — tggcattaatgattggatggccctcctgcccttrgrgctttttagggtgaggaacac 

REVERSE TRANSCRIPTASE 1022 O I H D W M A L L P F V L F R V R N * 

mi i 1 1 ii mi I) I inn nm i m mill tun n 

Sbjct: $178 cggtgg^a— aaagdctgggtga^cotccttcccttagcgctgcTitagggccaagaatac 

pol poiyp*OT*in 9Sfi § 5 JC0wVTLJ*PI#AIiIiRARKT 

Query: 5282 ecctggacagtttgggctgaccccctatg-aattgctctacggggga — -ruwinnming 

REVERSE TRANSCRIPTASE 1041 PGQFGLTPY EtLYGG PPP 

1 1 1 1 ii i mm i ii ii mi mi mn 11 m 

5fcj Ct2 $235 ccctggccggtttggtttaactccttatgaaact-ctctatggaggaccaccccccatac 

pol polyp*6tein 1015 PGRFGLTPYEX LYGGPPPI 

Ouery: 5337 ttg-g~c— agaaattgcctttcfcacatagtgctgatg-t-gc;tgcjt-tcccagcctt 

REVERSE TRANSCRIPTASE 1059 I A EIAFAHSAD V I. t S 0 P 

in i i nm nm ii i mi i » ' u ni l m 

Sbje*; $294 tttgagtotffgfttfaaa ettt* ^gtcoc^tg^Uagal:— ttctccct^ctt. 

pol polyprotein 1034 I, E 3 G E T I G p D D R FLPVX 

Query; 5389 -tgti-ctctaggc-tcaaggcgctcg-agtgggtgaggeage-- gagcgtggaagcagc 

REVERSE TRANSCRIPTASE 1076 L F 5 R L K A It E W V R Q * X H K Q 

I II Ml I 11 Mill MM M IM 1 I II Ml I IN 
sMct: 53*3 at-ttactc-a— cttaaaggctctagaaat-tgtaagg-acccaaatc-tgggaccag^ 

pol polypor^teia 1O&0 FT K LKALBIVRTQIWDQ 

hnp:/Avww.ncbi.nIm.nai.gov/blasl^ 21AS2/01 
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0141 221 7739 PAGE. 22 



-23-01 10:50 From: FOLEY, HOAG & ELIOT LLP 
Blast Rcsuli 



2027856687 



T-049 P. 32/53 Job 
P*g« 10 of 16 



E M mi ii i n ii ii I m i ii u ii Mm ii ii 



gattc-agt-ctatgtUgacgccaccs-^ 

m. . . T » . . ■ * • -„ ,«, ;„»,;, , 

S4S2 ga-Uaagtgct-tgtcagacgccatcgacctr — 

1067 D QVL V R R H R P 



, S452 ga-tcaa^ct-tgtcagacaccatogaccca-gciiccfctgagcctcggtggaaaggc 

poljolyprotain " « « * V * R H R P S S I E P R W K S 



SSS6 ccttatct€gtacrtt-ttgaccacaMaacg<(Ctgtgaaagccgaa9|aatccccacctg 

tab, u« » . » . • ,t j » « ,?imm? ,? , r nn 

env protein 1 

_ 561S gatccatgcatcccacgttaagccggcgcc«octcccg-actc-g~gg| tggagag 

wnn mffcixmn mi « * . « y k p a 9^ ^ I ?i ? t t^i mil. 

VlRW ENVfiWPE PROTEIN 1? P X » * * * • * » J A S X X V V ** 

REVERSE TRANSCRIPTASE 1163 A j Y,,T Ml.f.M "l I ? . I II ' ' 1 I " 

sbSet . s$26 ctggaaaagactgateacecE«t»agct--g«gt-- attcggcggc-gg--cgggacga 

Hi pr^ein 25 W K I I I X L I S « V P « « G « * 

pol pSlyprotein 1145 L&KTDHPLKL R IRRR R J> S 

5721 -tct--aacaat---aactcc--c<! ! »ggcc«gtagtaaacgeotta-tagaca|M-c-fl r 

M Www worn* as r. * i * / o a s_a x * l ids s 

f gi^caa-aataagaaccc^acc^ 
protein 43 s&Q N Kwrnv^ « 

pol polyprotein 1X52 5 A s 



S W fB PROTEIN .1 . ? K ( l^L. , n , .* |? , ,» * 

5821 ggtg tcactgtaaa-t-a-gca c~tcgag|- t -g--«g-c«^t-i l g a - 

fiM ct - 5787 ggtgg^cacact-raaacetgatgtatgtgccttggcggctagtcttgagtcctgggat 

env protein 7* W » » t I KPDVCXLAAStESWD 

5860 g4 cacc~t r gt-g g =--ct 8 a B ct 9 «t t tcM^tcc-gatt~g a « a - 

VZRAL EKVBLOFB fnOIIlIJt 41 - t « Tl Til I 1 II 1 Hill II I Mil 

Sb3ct8 « 4 f "cccgggaaccgatgtct^cctctaaac-g^-gt^^ 

env protein 59 ipejgv»a*» 

Query: 5903 ac-cccg C *g«**>*gca— «cct ccca-a- c-«agtc-c|— ta 

VIRAL EJJVEWPE PROTEIN 9S » TllTl III I -I Ml HI " 

sv Hei .. S903 aetLgiia="^"taag=a'aat C acctgg35agccatagggtgc 3 |ctac<^s««ct i » 

ISv protein U8 TAA vkoitwg AI sc3YPRA 



MflR 23 2001 05 '49 



0141 221 7739 PAGE.23 



MAR-23-01 10:50 From: FOLEY, HOAG 4 ELIOT LLP 



2027856687 



T-049 P. 31/53 Job-4S4 
Page n of 16 



Query: 5942 g t--t*tgg — gttcta — it g-ctg-cce— -a-ggc — -ae— -agag 

VIRAL ENVELOPE PROTEIN 108 S Y G F Y CCP G T E 

I I flit I IHt ( 1 1 II Ml I 111 II J J 
Sbjct: 5961 ggaetagaatggcaagctctaccttctacgtatgtccccgggatggccggaccctttcag 
en* protein 13"? RTRMAS5TFYVCPRDGRTLS 

Query: $974 aaag — agaa»atactgtgggggttctggggaat-eet-t-ctgt-aggagat-gga — g 

VIRAL ENVELOPE PROTEIN 119 K EKYCGG5GE5FCRRW S 

Ml MM 1 I I MM! II MM Ml I MM I i J I) Ml 1 
£b jot i £021 -iAgetagaaggtge-g-ggggg--et — agaatccctatacrgraaaga-atgggatt g 

env protein 157 £ A R R C G G I. SSLYCKEWDC 

Query: 602 4 ctgcgtcacctccaacgatggagaatgg-aaatggccgatct — ct-ctc-caggaccgg 

VIRAL ENVELOPE PROTSIN 136 CVTSNDGDVKWP'I S L Q D R 

Ml Ml Mi II Ml II I Ml I MM I Ml I MM 
Sbjct: 6073 -r0*g — acc-- — ar<f— gg-gncr.ga^tat-tgg-r.f.atcfcaaflr.cctc^aagarr'.-- 

env protein 175 £ T T GTGYWLSKSSKD 

Query? 6079 gt-a&&Attctcctttgtc«4ttecggcccgg$c**9ta-caaaatgafcgaaaot~ata't 

VIRAL ENVELOPE PROTEIN 154 V K f S FVNSGPGKY K M W K L Y 

I 1 II I Ml II 1 M M M ft I MM I Mill 

Sbjct: C120 -tcataa — c tgtaaaaL-gggacc aaaatagc-gaatg — g — actca-a- 

ehv protein 190 LI T V K W D Q N £ E W T Q 

Query: 6136 aaagat-aagagctgctcccca-teagaertagat-tate taaagat4-agtttc 

VIRAL ENVELOPE PROTEIN 1*53 KD KSCSP St>I*p Y LKI S F 

Ml I If II II II Ml MM I I M I Mil III I Ml I 
fib jet: 61*61 aaatttcaacag»tg-tcaccagaccggct— ggtgtaacceccttaaa~ataga-*tttc 

env F*9tcin 204 K T Q Q C H Q T G WCHPLK I t> P 

Query: 618** actgaaaggaaaacaggaaaa^attc^a^aagtggataaatggtatgagctggggaat-- 

VIRAL ENVELOPE PROTEIN 190 T E R K T G K YS KVDKWYELGN 

II I II ) I M1IMI 111 III I MMMI II I I II M I M 1 
sbjct : 6215 ac — a — gaeaa-ag^aaaattatccaaggactggataacgggaaaaacctggggaita 
em/ protAtn 222 1 OKGKLSKDWlfGKTWGt 

Query; 6244 agttttttattatatg-g-qggggga-gcagg-gtccactctaac<sattcgccttaggat 

VTKM. £HVQ£CP& PROtEIH 2 09 3 F L £ X Q Q <3 A <J STI»*JRliJ<3 

M III MM I 1 II 1 MM II II I j IMMMI MM I 

Sto jet: 6269 ag^ attccatgtgtctggacatccaggcgtac^-gttcaccattcg-ctta-- a- 

anv protein 240 r f y v s g h pgvq ftir l 

Query r 6300 aga-gacggggaca-gaaccccctgtggcaatgggaeccga — taa*gtactggct-gaa 

VIRAL ENVELOPE PROTEIN 228 t T G T EPPVAKGPD K v L A E 

I I II Ml I II IMIMJI I It II It I I! II I I Ml 
Sbjct : 6318 aaatcac«caacatg«ccagctgtggcagtaggtcctgacct^-cgtccttg-tggaa 

anv protein 256 K X f N M PAVAVGPDL VLVE 



VXR3& ENVELOPE PROTEIN 246 

Sbjct: 
env prottin 



6355 caggggcc-cc eggecctgg— -agceacegeataacttgec gg — tgece 

*"fi ? PAL BPPHNLP VP 

M 1 1 M 1 1 II tIM I Mill i I Ml II M || I 
6371 caaggacctcccagaaegtccctcgdtcr£ccacctcct--cttccc££aagggaagcgc 
274 QGPPRTSLALPP? LPPREA 



Query: 6400 caattaacc-tcgetgcggcctgac-ataa~— cac&gcc gc — eta— 'gca-aeag 

VXRXX ENVELOPE PROTEIN 2£1 9LtSLR?0I TQP P S N S 

M M M II I II IM Ml Ml Ml I M MM Ml I I 
Sbjct: 6429 ca — eegceatctct-c — cccgactctaactccacagccctggisgactagtgcacaaac 

env protein 293 p ppsl pdsnstalatsaqt 

Q\Mty: 6447 taccaetggavigattccta^eaa fiafigcctagaa-ietec^ccaggtgttefitg-t 

VTRAL ENVELOPE PROTEIN 277 TTGLIPTN TPRN$PGVPV 

I MM II MI I III 1 1 Mil M 1 1 1 1 1 II III 

Sbjct: 6484 tcccae-gg — tga-gaaaaacaattgttac-ccta-aacactccgcc — ■ tee — > 

env protein 312 PT u-RKTIVTLNTPP P 



http://vww,ncbi.dm.nfo 



21/02/01 
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0141 221 7739 PAGE. 24 



MAR-23-01 10:50 FronuFOLEY, HOAG & ELIOT LLP 

Blast IU3uh 



2027856687 



T-049 P- 30/53 Job- 
Page 12 of 16 



Query: 

VIKAL ENVELOPE PROTEIN 295 



anv protein 



6S01 taagacaggac-agagac-tctteagtctcatccagggagetttcc^aagccatcaact- 

i inn i t nit t ti i tn i mil fi nil u mini 

6529 caccacagg-cgacagacttctcga-ccttgrtgcagggggccttcctaa»cc-ttaaatg 
G DRLFD I* V Q G A r L t L N 



327 



T T 



Query: 

vim- envelope protein 

Sbjct; 

env protein 



6557 
313 



^caccgaccctgatgteaettcctcttgttggctttgtctat-ectca-ggg-cctcctt 
STDPDATSSCWLCL SS G P P 

i id mi i himi inn liiiinii ii in in 1 1 nit 

6585 ctaccaacccaggggceactgagtcttgctggctttgt-t-tggc-catgggcccccct^ 
345 ATNPGATESCWLC L A M G P P 



Quary: 6614 attat<raoma^-gg«taa^ 

VIRAL ENVELOPE PROTEIN 332 Y Y E G M A K 



sbjee; 

•nv protein 



€642 
364 



agaaagaaaatteaatg — tga-^ecaaagagcataga 
E R K F N V TKEHR 

I III) II t It Ml I II II I II I I I IM Nil! 

attatgaagoaatagectcat;*** ^J^gto— -gcct-actccaccga-ccttg- 

YYEAIASS <3EV A Y S T D L 



Query: 

VlftAL ENVELOPE 2 ROTE IN 350 



Sbjct: 

onv pr^t*in 



6667 aatcaatgtac-atgggggtc^aa—ataagcttaccctcactgaagtcreeggga-a 
N Q C T W G S fc NKLT1TBVSG K 

tin i nun in ii linn minimi in i in t 

6693 -accggtg-ccgctgggggaocc-aagga-aagctcaccctcactgag^tct-caggaca 
aei b ft c pwar qg kltltev sgh 



Query: 

VIRXL ENVELOPE PROTEIN 369 



Sbjot: 

env protein 



6 tOlGXAPP SMQHICY $ T V 

in imiiti mi i m t mi n n n in n u n i 

6748 cgggttgtgcatagg-aaaggtgccctttacccatcagcatccetgcaatcag-accct- 
«00 I I C I C K V PFTHOHLCNQ TL 



Query: 

VTltAl, ENVELOPE P&0?£XN 388 



Sbjct: 

onv protein 



673X gttrargagcaggcctc™ agaaaatcagtatttag-tacctggtrataac aggtg 

V V X Q * C SN0YIVP6YN * 

I IM 1 III! III lllimi Mill I III II It 
6305 ateeatca — attcctttcggagaccatcagtatct-gctt5cc™ctccaaccatagctcf 

31 H33«OHQYLtP SMHSW 



Query j 

VIRAL ENVELOPE 

Sbjct: 

env protein 



6834 gtgggcatgcaatactgggttaacccectgtgtttccacctcasftcttcwcca-atcca 
PROTEIN 406 WACNTGLTPCVSTSV FNO S 
?H 5 HIMI Itll MM! I MM! M M I III I 1 1 1 1 III III M I I 

6365 gcgggcttgcagcactggcctcaccccttgcctetccacctcagtttttaatcaga-ctd 
437 W ACSTGLTPCLSTSVFNQ T 



Query: 

VIRAL ENVELOPE PKOTSIK 
env protein 



6S93 aagatttctgrgc-cat^gcccaaatcg-tcccccg-agtgtactaccatcctgaggaag 

425KDTCV MVQI V ? K V * * « B * 

tiiuiMM i ii mm i i i ii ii i i ii m iiiiin mi 

691B ^qatttotgcatiica-ggtccagct-gattcctcgca-tctattactatcctgaagaag 
4S6 K D t C I Q VOL I t R I Y Y Y F E E 



Ouery: «B50 tggtccttgarga-a ratgactatcggtataacc — 4 gaccaaaaagac aacccg 

VX1U ENVELOPE PROTEIN 444 V V L ^ B ( ^ D ( * * X f W ( l^p^Tf^ f , 

6975 t-~t— ttg-ttacaggcctatgacaat — tctca^cccagga-ctaaaagagaggctg 
V LLQAYDN SHPRtKRBA 



Sbjct: 
env protein 



475 



Query; 
VIRAL 



anv protein 



7001 tatcccttaccctagctgtaatgetcggattaggg acggccgttggcgtaggaaca 

ENVELOPE PROTEIN 461 V S L T L A V M L fi L G T A V fi V 6 T 

in mimtttiiti t ii n n in mi n n nn ii 

702 6 tcticacttaccctagctgttttactggggtt-g^gaaT:e*£igg-ca— ggaataggtacfc 
492 VSLTLAVLLOL GIT A CIGT 



Quary; 7057 gggacagctgccctgatcacaggacc-acagcagctagagaaaggacttggtgag-ct— 

VIRAL KVOO* WOll 480 )? » * |? T^J ( 0 Q ( L * ( K (| G ^ . . 1 

sbjct: 7C 
env protein 511 



7082 ggttcaactgccttaattaaaggacctatag-acctccagcaagg-cctgacaagcctcc 
511 GSTA1IKGPI D L Q. 0 G L T 3 L 



Qu^y: 7113 ncatgcgqocat-gPoagaagaicrccgag-oattaaa^Bgt^tat^eA^fit^A* 

http:/Avww.ncb(.nIm.mTi,SQv^Iaat/b|2seq^ 21/02/01 



MAR 23 2201 05149 



0X41 221 7739 PfiGE.25 



MAR-23-QI 10:49 From: FOLEY, HOAG & ELIOT LLP 
Blast Result 



2027856687 



T-048- P.28/53 --Jcb-484 
yzQt is at 16 



VIRAL EKVUOP* PROTEIN <9. K * * « (( T ^ D t () R ( A I X Z B V ^ K ^ 

-J 171 eavtccctoacttctttfftctgaagtggcrctacagaaccggag-aggac wga-tctsrc 

M « — » ». . . |fllTllf t„? 1? « ,» lf ,? ,? }„f , ,J , t,, 

... t . 1x96 gactcaetgacttccetgtcegaggtagtgctccaaaataggagatfgcoct-gact-tgc 

S%t*lo fi « " D « L » 8 t * * V V I Q » * » « 1 0 I 



VIWO- SW^Ct-OSE ? ROTE IK 



7229 t^tttctaagaga^tgogttatgcgcAgccttaaaagaagaat^^cttctatgtag 

iiiiiiiii iiiiinii J inn in mi mil H- «< i« " m 

sb . cr . 72S4 tfltttctaaaagaag^tggGctctgcgcggccctaaaggaagaattf ctgtttttacata^ 

7289 atcactcaggagcca-t-cagagaotccatgaacaagct— tagaaaaa— agttagaga 
VIRAL " ENVELOPE »oni> «7 f H 3 . X I f D ■ * ■ « (f , 

, . W4 ieiiUiiggtg-cajtac-gggaotccatgaaaaaactPMagaaaaactggata-aaa 

anv protein S88DHSS A V ROSMKKXJCEKtD K 



Query: 7343 ggc-g-tcgaagggaaagagaggctgacMggggtggcttgaaggatggttcaacaggtc 

. 73n gaeagttag-agcgwsagaaaag ccaaaactggtatgaaggatggttcaataactc 

Sv protein 607 R 0 L E * Q K $ QNWVEGffFNNS 

0ll6rv , 7401 teotwatgaccaocctgctttctgc--tctgacggggcccctagtagtectgcjccfcs 

™ZWo M »« j . . . i |? t » Miii'im ii mmJiji 

sb H cr . 7426 ccctt^gttcactaccctgctatcaac^atc-g-ctgggcccctattacteotccjtctg 

ell protein 625 SWFTTLLSTI AGPLLLLLI* 

ouerv . 7459 ttact tacagt- - t-gggect tgc ttaattaataggtttgttgc-ctttgt taga-ga-a 

SiS'bwbow >» « l » , » ( . . ( = f |f ,» » f » x f ; » |f , 

sh j ct: 7484 ttffct--ca-ceeMgggccatgcatcatcaataagttagtt-caattcatca-atg«t:a 

env protein W I I IX.GPCIIKKI.V G F I ND 

0uery . 7613 asgagtgagtgca-gtccaga-tcatggtacttaggeaacagtaccaaggeettej-gagc 

VZRAL ENVELOPE PROTEIN 632 R V S A V 0 I M V L R Q Q V Q a I T. S 

J | I I 1 1 I M I I I I H I I I I 1 1 1 1 I J I l I II I Mill 1 1 I H 
sbjet- 7S3S gga-raagtgcai-gttaaaattc-tggtccttagacaa-aatarc-aggcc— ct*£ag- 

env prct*ia 663 F I S A C ^ A 

0uary - 7S70 caaggagaaactgacctctagccttcccagttctaagattagaactatt^acaagacaag 

VIRAL* ENVBWPB PROTEIN Q « » « D ft ~ |( ,„„„„„„,„ 

3b j ct . 7592 -aa— egaaggtaacottt^ttbt get£taagatt»g*getati:cacaagagaA- 

Query: V 630 aagtggggaatgaaagga-ug--aaaacgcaaectaaccctc^(^ 

y I I 111111111111 ill I I II II II III III II II 

Sbjcts 7644 atg-ggggaatgaaagaagtgtttttttttagcc-aa-- ct-gcagtaacgcc— attt 

Qu4 7685 taata--*a**geT:ctaaatacccdeg--aat^^ 

9 V I II 1 I I 111 II 1 1 II II III II 1 in m I 

sb j ct: 7696 tgctaggcaca-c^ta^a-ggataggaaa^atacag — ctaagaaeagg— gec— a 

nuerv , 7737 aatagg-ta— gaaggtca— cacttwtattgttccagggcctgctatcctggcctaa 
UU y ' II III II I II IM 111 J M M II II II I Ml I I 
Sbjct: 7745 aacaggatatctg-tggtcatgeac— c tgggee eeggc— ceaggee-aa 

Guary; 77fH gta-aga taacaggaaacgagttgact-aatc-gcttatctggaut-ctgraa-aa 

wry || ||| i l| ll IM It 111 f tl I II I I II I I II I 

http://wvw.ncbi.nlmJuh.Eov/bl«tAn 21/02/01 
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-23-01 10:49 From: FOLEY, HOAG & ELIOT LLP 



2027856687 



T-049 P. 28/53 Job- 
faflc HOI io 



Sbjct; 



11 90 w «flagggttccca-gaMt-.aflatqagtc4a-caijc--a ^tccaaew* a 



79u UUJttUcUJt^ 



sbjct: 



mc 4tiili4— t-l-iJi Mgi -«<«-«-->t~-*>"-=W9" 



Sbjct: 



79*4 "ttaaaat g attg gt cc^ 

7S21 Hl-l lie ccitggc-cU-iUttgaactaa ccaa* 



Sbjct: 



80!3 gg f cgc-gggctc^atat^^^ 
Sb5ctt 7 »4 -il-ittleilieiW t ct 9t .ac-c- C - g cg«U-tt s c 

fl070 ^cccat.a-^ 

79S>0 Li t i i_ii t ii;Ii.^^ g ^o t c- C »c C cog4ffcac-c*gt C ct-t a 



3033 UaiacU-i^Ucici-iJtac;c- g t,t g -- t cc a at a aa a --cc t ctt 8 ct g att 



Sbjct 



8183 gca 8185 
111 



Sbjct: 8086 9 ca 8088 

scor* - 113 bits (1449). £*pect = 9 «;22 
Jd°rUi*i*- = <484>, Gaps - 460/1182 (38%) 

Strand = Plus / Plus 



7«Sc«--WtK«,^«0M«-^--««--«^'9y'5«--J----«^-- 

«-»._ as* -=«- T - T -»--?'rT t T--=T-7 a i ;;f""f 

ill i i in in t i hi ii i 

ttgcatccggaqccgtggtctcgttgttccttgggagggtttctcctaactattg 



VIWUi EKVEWJPB PROTBJtf «13 I> 
Sbjct; 60 « ct * at 

Wl -g^-tt.-tgt-t Br -tttg t-t-aga-.-gaa-cgagtga 

3bjct! ua accicoca^icggggiU-caiat^ggggctcgtc^ggatcggaaaccccaccca 



aag-g 



Query gtg--ca--g-tcca-g a t C --^tggta--cttaggcaacag-ta«-a; ^ 

eifl y. 1L S O 6 E T D L 



Query: '"^ i r, r v t u j. 

VIRAL EHVELOfE PROTEIN 648 I 1 S Q 6 B | , , , ,, , I 1 1 1 1 1 I 

1«0S rrr - rrrT ac rrnr -- r a nt : r -^ (r -^ 

http://w^4ic&rim.wh.gov/^^ 
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Blast Result 



2027856887 



T-049 P.2F/53 Job-494 
Pflgfc 15 of 16 



Query: 
shjcct 

Query: 
Sbjcti 

Query: 

Query: 
sbjct! 

Query i 
Sbjct: 

gas poly^rotein 



Query: 
Sbjcx: 

gag polypratoin 



Query: 

gag polyprotein 



Quary; 
SbjCt: 

gag polypr<=>t*in 



Query: 
Sbjct: 

gag polypt^tein 



Query; 
Sbjct: 

gag polyprOteiA 



Quary: 
sbjett 

gag polyprotein 



Quary: 
Shjct: 

gag poiyprotain 



Query! 
sbjcti 

gag polyptotain 



Query: 



7635 

II III 



-tgaaag-ga 
II II II 



— tgaaaat- 
fl I I 



gc-aaccta- 
II 111 I 



-ac c— 

II 1 



-etc- 
HI 



3S2 ggrgiatcactctgcgaatgacgtgtgagtagcgaac-agacgtgttcgg^gctcaccig 



7 669 ccag--aaccca-gga'*a-gt — ta~-*t-a- 



-a--a — a- 



410 



-a-gctctaaatg— c 

nT'TrTu TIT T n 1 n T i ■ " » '« n hi 

Utggtaatccagggagacg^ccca^gatcaggggaggaccagggacgc-ct-g?rtggac 



7705 



4€8 



77 1 if II 1 I I MM I I I I Ml II I III HI U I 
iUUggt;Icgggtcartgtgaccc-<iattxcatc5COcgt--ct-ggtaagacgcgct 



7iai M cacwct-«^t"tc.c-*g^-4«t -gar*Tx:--ctgg e-efcaag-taa 

7131 ,,t il hi I 11 | j ( MM M 1 1 MM I 1 I I 



n*S gataacaggaaa tgagt-tgactaatc— gct-r- — ate-tggat^ 



1 Mill i M I III M Ml 



562 
1 



i-ttti^gaaagcetc^ 



I 1 1 1 I Mill Ml I II I I Ml M Ml MM 
640 garaattctacccct-auctccctcactctaaatcaetggagagBrgtgagaaca^aggg 
4 PHSTP ISIfTLKHWRDVRT R 



-zczq — cc-c— 
1 1 1 1 M I 



7874 ttgaca^ccota«^gt-gacctatctcaa6!:gcaatctgtc--a^ 

lilt Mi MM I M M III M > M 
£3 AH HLSVE IKK 



GKWQTFCSS 



7921 



ag-gageccae geagatgeg— gacctccggag cz - - 

II I HUM M 1 M I I M 1 II I II M * 
7S5 agtg4ccoacattcggc-g-tggggtggccaccggagggaacttttaatctctctgtc^ 

42 £ W PTfrG V G « P P B ' 



6 T P V Ii S 



79*4 tx --ttaaaatgattggtc-—c»cgw*-c^ 

II mil! Mil 111 M MM M Ml I M 1 M III 

812 tttttgcttgttaaawgatt^wtt^^ 

fil I F A V K K I VPQ EKGGH P D Q V 

7397 *aaatgat--t$gtccatgg-a^ 

I | | II I 1 1 1 M II 1 1 II It I I M M 1 HI tit 
8*5 C atatitcgLgg t --atggcaggac-ctcgc-c-caga-atcccccac^ t g-«gg^ 



8048 -trg— tgacg-csPAg— gcc-ttgt-tg«^*-aoo- 

I I II M II HI HI) ! II! HI 



— oca-t- 

Ml ) 



919 ccagcct-ccgeca~ a ggt*g 5 tg_^ 



$7 



p A 3 A 



KVAVV5DTKRP 



8Q81 -*-aagct-gt-cccgattccgca-ct---c^ 

i | | * I i j | | I l 111 M I M I 1 1 1 1 i H 



977 catc«gQtcet*e«cga 



116PSAPP&P* 



XYPATDDliIiLI.5 



Biifi -aece-cta eg — t gg-tgtac-g— a etgtgggccc-cagcgcgct 

MM I I Ml 1 1 I M I I II HI HI HI HI * 
1037 aiUU4c«c^ 

135 E B ? PPPYPAALPPPLAPQ A i 



8158 tgga- acaa aaat-cctcttgct-gtttgc-atc 8187 

||| || I M M I l I I I M Ml 
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— Page IS of 16 

Blast Rt$uU 

gag palyprotein lS6 * r 

is u uaar sees. oa» »V«. 

flapped 

Lambda * H „ A4 , c 

0,0514 0.0120 0*0135 

Gapped 

Lambda K n _ A ,^ c 
0?0$14 0.6120 0.0X35 

^Penalties: &d«encc: 1, Extension: 2 

Number of Sequences o 

Number of extensions: 79b4 

Sumber of successful extensions: 13 

dumber ot sequences better than 10. Os 1 

l*ng*th of quo*yi 9209 

imgth of ^tabase: 2,6ia^3^S56 

effective HSP lengths 1920 

cffaotiv* length *f *a*ryt 6289 

effective i^n^h of *^*> 'ig;*;;*'"* 

affective search space; l7 5 d0 f;lS*?I21. a7D - 
elective search =p*c* uae4* n«oSS2843704 

f: 0 
A: 0 

xl: 135 (10.0 bits) 
X2; 1347 (99.9 bits) 
31? 250 (24,9 bits) 
S2: 463 (40-7 bits) 



hrtp://ivwwj*^ 
MAR 23 2901 05:51 
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tuasiitesuit 




Blast 2 Sequences results 

■ PubMed Entrez BLAST OMIM Taxonomy 

BLAST 2 SEQUENCES RESULTS VERSION BIASTN 2<U [Nov-13-2000] 



Structure 



Match: I . J Mismatch: 



gap open: IL 1 gap extension: iL.i 



Sequence t p 4538106 Sequence 3 from Patent WO9740167. Leng^te 1575 (588 2162) 
Sequence 2 gi 9^30311 Gibbon leukemia vims, complete genome Length 9.370 (631 30QG) 




NOTE: The statistics (bitscore and expect value) is calculated based on the size of nr database 

NOTE'Jf protein translation is reversed, please repeat the search with reverse strand of the query 
sequence 

Score » 362 bits (4802) , Expect = 2e-98 
Identities =* 1164/1149 (66%), Gaps = 345/1743 (19%) 
Strand ■ Plus / Plus 



Query: 588 
VIRAL CORE PROTEIN 1 

Sbjct: 631 
gag polyprotein 1 



atgggac-agacs^tga.ctaccccccttagn-t-^'tgactctcgRCcattggact 
HG QTVTTPLS L T L D H ? 

limn in i i mini nil i inn i u 1 1 1 1 

atgggacaagataat-tctacccc — -ta-tctccctcactctaaatcactgga< — 
KGQDN S T P ISLTLNHW 



Que*yi 6^2 
VIRAL CORE PROTEIN 10 

Sbjct: 684 
gag polyprotein IB 



-gttogatccAgggctcataatttgtcagtccagg — -ttaagaagggaccttgg'c 
VRSRAHNLSVQ VKKGPW 
II III I I J I 1 1 1 1 J III I II II It I I! IINit 1111 

tgtgagaacaagggctcacaatctatccgt — ggaaatcaaaaagggaaaatgqrc 
VRTRAHNLSV EIKKGKW 



Query t 698 ttt^tgtgoctetgaatggccaacftttegac^ttggatg^ccatca^ggggacct 

VIRAL CORE PROTEIN 38 FCASEWPTFDVGWFSEGT 

1 1 1 1 1 1 1 nil ii tun 1 1 1 N 1 1 n ii mini i inn ii i 

Sbjct; 741 tttctgttcctccgagtggcccacattcggcgtggggtggccaccggagggaactt 

gag polyprotein 38 FCSSEWPTFGVGWPFEGT 



Query: 758 t — tetgaaattatcctggctgttaaggcaatcatt — tttca-gactggaccc 

VIRAL CORE PROTEIN 57 S'EItLAVK'AII FQTGP 

I UN 111 I I II Mill II III Mill II II I 
abjet; 001 tctctetg^catt-t — ttgcagttaa aaagattgtctttcagga ga-ac 

http://\vww.ncbi.nlm.w^ 22/02/01 



MPR 23 2001 05:52 



0141 221 7739 PAGE. 30 



UAR-23-01 10:48 From: FOLEY, HOAG & ELIOT LLP 
Blast Result 



2027856687 



T-049 P- 24/53 Job-494 
Page 2 of 5 



^ag polyprotein 57 L S 



F A V K 



V f Q E 



N 



Query; 

VIRAL CORE 



811 
PROTEIN 75 



Sbjct: 

gag polyprotein 



Quesy; 

VIRAL CORE PROTEIN 
Sbjct: 

gag polyprotein 



851 
74 



8 65 
93 

90S 
92 



ctcatcctgatcaggag-- cectatatcet-- tacgtggcaagatttgg— cagaa 
SHPDQE P Y I L TWQDL A E 
MIM II M li II IIIMI I II HIM M I t I I II 
gacatccggacca— agttccatatatcgtggta—tggcaggacctcgcccaga- 

G H P P Q VPYIVV 



W Q 0 L A Q 



ctccgccatgggtttta&ccatggct&aataaecoaagaaagce-&ggtccGcg&at 
P P P W V KPWLNKPRKP G P R I 



1 II Itllllll IN I 11 II 

ecccaccatgggt — gcca-gcct cc- 

pppwv pa 5 



111 IMM 1 

-™ g ecaaggtcgctg — t 

A K V A V 



Query: 324 

VIRAL COBB PROTEIN 113 

Sbjct; 944 

gag polyprotein 105 



gctcttggaga-— gaaaaacaaacact— c— — ggccgaaaaagtcgagc-cc 
A L G E KHKH S A E K V E P 

| I 1 1 II 1 I II M It I I I I 1 1 I II HI It 

tctct— gatacccg-aagae — cagttgcggggaggcc-a— — te-agctcc 
VS £> T R R PVAGRP SAP 



Query: 973 cctcg 

VIRAL CORE PROTEIN 129 S S 



Sbjct: 

gag polyprotein 



tatctaccccgag-atc^gaggage — cgccgact-tggccgga 
Y L P R D R G A AD LAG 
M M Mill till I I I IMI I I II II 1 I II 

989 cc-cgaccccccatcta-cccg-gcaacagacga-cttactcctcctct-— ctga 
120 P R P P I Y P ATDD LLLL S E 



Query: 

VIRAL CORE PROTEIN 145 
sbjct: 

gag polyprotein 



1021 caacctgttcccccacccccttatccagcacagggtgctgtg&ggggacctct-gc 
PTCSFTPLSSXGCCEGTS A 

II I UMI 111 Mill II II III I HIM ti 

1043 — &o— g— cccoogctoe tatccggc ggeactgeca — cccafitctggc 

138 T PPP Y P A ALP PPLA 



Query: 1080 c 

VIRAL COI^fi PROTEIN 165 



-e~t-gga— go— tccgg tg — g-t-ggaggga-cctg — - — c 

y GAP V VUGP A 

I l | III II III! II Ml II II IIH I 
Sbict • 1037 caaacgatcggaccaccgtcaacccagatgcccgatagtagcgatcctgagoggco 

gag polyprotein 153 QAIGPPSGQMP~~ 



DSSDPEGP 



Query; 1113 
VIRAL CORE PROTEIN 176 

Sbjct: U47 
gag polyprotein 173 



gggactcggagccggagaggcgcc-accccggag-- cggacagacgaga-tcgcga 
GTRSRRGA TPE R T D E I A 

i ii in ii i in mi mi ii i i urn i mm i 

. , c tggga~ccag~gagtcgccgtgcccgcagtecag-cagac-a-actcg-gg 
A G TR SRRARSP A P N $ G 



Query: 1159 acc-g-ctgcgcacct — a — tggccctc ccat-g ccagg— g-gg 

VIRAL COWS PROTEIN ,*„*.! I IUMII IMI. N II I I 

Sbjctt 1195 -cctgact-c-ca-ctgtgattttgcccctccgagccataggftcccccggccgagc 

gag polyprotein 189 PDS TVIL 



PLRAIGPPAE 



Query: 
VIRAL CORE 



PROTEIN 208 



Sbjct i 

gag 1 polyprotoin 



1209 ttgc—agcccetc— oagtattggeccttttcttetgeagatctetataattgga 
L Q P L QYWPPSSAPLYNW 
, II I Mill II IIIIIIM Mill M MINIM M1MMIM 
1251 tggcctggtccctctacaatatcggcctttttcotcagcagatctttataattgga 

206 GLVPLQYWPFSS-~" V "" 



P L Y N W 



taaccatccccctttctcggaggatcccca— acgcctcacgggg-t—tggtgga 
HHPPFSEDPQ RLTG LVE 
1 1 J I II I M I 1 11 II If I MM I I IMM MM 1 I I II 
httptf*^^ 22/02/01 



Querys 
VIRAL CORE 



1265 
PROTEIN 227 



014 i 221 ??39 PAGS.31 

MPtR 23 2001 05 152 



23-01 10:47 

Blast Result 



9 P. -2 3/63* Jcb- 




Blast 2 Sequences results 

■SSft- BLAST 0«,M — 

BLAST 2 SMOTNOS MSCLTS VERSION BIASTN IU (Nov-,3.2000) 



Sequence 1 giiSlM Sequence 3 from Patent gQgg go88) 
S^ aaa ce 2 tf Sf^LL Gibboa leutenua vuus, comple* genome Un^ 



8 . 

sequence 
Strand = Plus / Plus 



REVERSE TOWSCRIPTASE 
Sbjct: 

pol polyprotem 
Query: 

Averse ti*ssciuptas£ 
Sbjct: 

pol polyprotein 
Query : 

REVELS* TPJOISCRIPTASE 

Sbjct; 

pol polyproteln 



2194 9«Wtcaooattc<j<jaccccctccccgaac T L T v 

2222 a C c a *tt*a 9 tt CC t n t g ^ 



228 0 ggaa*acta~---a f «^ 
40 6 K L K B K K S W 

HUH II . ilL aa l c— crgtcgtgga-aggagcgacaggc 

2311 ggaaaagtagggtccagaegga c sra-jog G * r g 

2331 -ag«tccatg g acc a cc Tar ccgtt---gacttgg r - g t (J ggacg 

poX polyprotcin 



rtAR 23 2031 E5i53 



0141 221 ?739 



PAGE . 32 



MAR-23-01 10:47 From: FOLEY, HOAG i ELIOT LLP 
Blast R**dt 



2027856687 



T-049 P. 22/53 Jcb- 
rage ^ w o 



Query; _ 
REVERSE TMNSCWFTMB 

sbjct* 

pol polyprctexn 



Query: 

AVERSE TRANSCRIPTASE 
Sbjetl: 

pol polypcoteia 



SvERSE TRANSCRIPTASE 

Sbjct: , 
pol polYP*otein 



Query: 

REVERSE TRANSCRIPTASE 
Sbjct: 

pol polypr^tein 



Query: 

REVERSE TRANSCRIPTASE 
Sbjct: 



Query * 

REVERSE TRANSCRIPTASE 
Sbjct; 

pol palypxotein 



2443 ccaaqa- 



$4 



G X Q 1 S 
II II I II 



F 

mi 



i i i 



Q G 

II I II I 



94 



T K t K 



A Q 



249! c-tgtgaataacaaacccatcactgtgttgacc- 



* n w i T iTi " in i 11 11 



-ctcc-aa— ttagatga 
1 Q LDP 
III II I II II 



lil 11 " 1 J.U_r-aetat0-ta-cctggtcctaaacct g gaaga 



HO T 

2545 g actatattctccccaagtaaa^ 

128 * L Y S P Q V K P D | | n, , || III 
2i«ij^4^ 

128 R L 



146 



" I' 1 1 1 1 1 1 1 1 ' ' ' ' iiiiUi-'iiccgiciiiiiacuiicii 
2625 gc-ttttccccactgUtgggeagaaa g age gg ^gw l ^ ^ 

145 ^FPTvwA* 

2653 cc^acag-gttatt T a-ct g aaggcca-|tg-ct a c a .c^-«- 



2707 acocc,tgagtagagaggctcg a g,aggaatttggc|gcat g t t ca> aa gat 



181 Y P M 



Sbjct: 

pol polyprotein 

g£»SE TRANSCRIPTASE 220 R K P G I » * * , , , , | I , , 1 1 I HI IN I 
sb . t . 28 47 alaaaag^U^^^^ 



Quary: 

REVERSE TRANSCRIPTASE 
Sb j ct : 

pol polyprotein 



Query: 

REVERSE TRANSCRIPTASE 
Sbjct 



287* ggtscaggacataca^ccaacggtcccga^ 



239 



V Q I) I K 



29 37 ectgaaegg^^ 

259 P E R N « Y i 1 1 1 1 1 1 1 1 Mil 

111 1 1 titUtliUiitUgtcUaUtc^caaggatgc^tttct 



2»65 cct-age 



http: 



; ; )w.ncbi. n ln».niU.gov/blast^Cs e q/^la S t2.ogi?0 



22/02/01 



NRR 23 2001 <3S : 54 



0141 221 7739 FftG£.33 



23-01 10:46 From: FOLEY. HOAG T-049 P 21/53 Job-484 

CJNMM B,ast 2 Sec 1 U6 nces results 

^PybMed^^^Enfez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION BLASTN 2.1.2 JNoy-WOOOJ 

Match: Mismatch: El gap open: gap extension: 
dropoff: E°? expect: ft?. : ?J wordsize: E gjtel r 



Sequence I gi_4S38l06 Sequence 3 from Patent W0974Q167. U&gtb 1971 (5620 .. 7590) 
Sequence 2 gi 26303.U Gibbon leukemia vims, complete genome Length 2449 (5552 ".. 8000) 



NOTE;The statistics (bitscore and expect value) is calculated based on the size of nr database 

NOTE:If protein translation is reversed, please repeat the search with reverse strand of the queiy 
sequence 

Score ^ 410 bits (5448>, E*pect = e-U2 
Identities = 1396/2290 (60%), Gaps = 560/2290 (24%i 
Strand - Plus / Plus 



Qy ery . 5620 atgcatcccacgttaagccggcgccacctcccg-actc-g-- tfgsf— tgga 

VIRAL E&TOLOPE PROTEIN 1 KHPTLSRRHLPTR G G 
REVERSE TRANSCRIPTASE 1153 HASHVKPAPPF PS G W 

nil ii Mi i ii ii il mill i i m i i ii mi 

cw-t-r. 5573 atgcttcteacctcaaacctgcaccaccfctccgcac-cagatgagtcctgg- 

env protein 8 MLLTSNLHHLFH Q K S P <S 

Si polSrotein U27 HASHLKPAPPSA pdesw 

Query* 5672 aaagactgagaatccccttaagettcgcctcca-tc-gc-- ctggttcctta 

VIRAX,'e*VELOPE PROTEIN 18 KRLRIPLSFA S I A W F L 
REVERSE TRANSCRIPTASE 1111 KTEWPLKLR LH R L V P Y 

] 1 1 1 1 1 1 1 i INI III IM I 1 1 1 1 Ml I I 

sbict • 5631 aaagactgatcatcctcttaagct— gcgt— atteggeggc-gg— eggga 

env protein 27KRLIILLS cv fggg g 

^1 poly^rot^in 1147 KTPHPLKL ft IRRR RD 

Query: 5724 -aacaat — aaetcc— ccag g-c— -ea gta--gt— 

VIRAL EHVELOPE PROTEIN 36 Tl TPQ A S AA ^ 

REVERSE TRANSCRIPTASE 1188 N H N S P G Q 

II III III 1 1 Mil I 1 M Ml M 

til Jwein 45 QMKKPHQPKTLTHQVJ.S 

pol polyprotein 1164 a K 



A A ^ 



Qvaxy. <wsfct — at- ^^cAgcrtcg^acccccac^cr^cctt— t*to 

VIRAL ENVELOPE PROTEIN 46 L I DSSNPHRP LS 

] 1 1 M 1 1 1 1 1 1 1 I I M M I 1 1 1 11 1 I 

Shict« 5^43 agacgttgtctgggatacaaaggeag-tccagcccccttgga-cttggtggc 

env protein 65 DVVvJDTKAVQPPWTWW 

http:/Avvw.ncbi^ 22/02/01 



MRR 23 2001 05=54 



0141 221 T?39 PAGE. 34 



MAR-23-01 10:46 From: FOLEY, HOAG & ELIOT LLP 2027856687 T-049 P. 20/53 Job-494 

BtotResult P*8 e 2 of 5 



Query: 5794 — acctggctgattat-tgaccctgatacgggt-gtcactgtaaatagcact 

VIRAL ENVELOPE PROTEIN 59 T W L I I . P D T G VTVNST 

1 1 1 I I II 111 II I I II Ml i 11 I I t 1 I Ml 
Sbjct : 5801 aaacct-gatg-- tatgtg-ccttg--gcggctagt--et-tgagt--c^ct 

cftv protein «4 K P D V C A L A A S L E 6 

Query; $fi49 ttgctcctagaggcacc— tggt — ggc — ctgaaetgcatttctg-cctc 

VIRAL ENVELOPE PROTEIN 77 V A P R G T « W PELHFCL 

I III I If Ml III I I N ill I I II I MM 
Sbjct : SS47 t — c-ce--g-ggaaccgat-gtctcgtcctctaaac-g-a-gteagacc tie 
*nv protein 99 I P GTDVSSSK R VRJ? 

Query; 5893 gatta-ac-eccgctgttaaaagca— - cacct— — —ccca-a c— 

VIRAL ENVELOPS PROTPXH 94 IMPAVKfi « P P N 

II II II 1 1 1 1 IN Mill IMM ||( | j 
Sbjct : 5897 gactatactgccgc — ttataagcaaatcacctggggagccatagggtgcag 
env protein 1X6 D Y T A A YKQITWGAIGCS 

Query: 5933 g"-tag — 't—tatgg gttcta — ttget — g ccc — a-g 

VIRAL ENVELOPE PROTEIN 107 R S Y G F Y C C P 

I Ml I Mil I MM t | M I 111 I I 

Sbjct: 5955 gggctaggactagaatggcaagctctacctt-ctacgtatgtccccgggatg 

env protein 135 RARTRMASSTF YVCPRD 

Qua*y: $969 cagagaaAg—agaa-atactgtgggggttctggggaAt-ect-t-at 

VIRAL ENVELOPE PRQTEIK 117 T E K EK YCGGSGE S F 

III III MM III M I J I li I I II III I 1 i 
Sbjct : 6014 ctttcag — aagctagaaggtgc-g-ggggg — ct — agaatccctatact 

env protein 15SLS EARRC GG L ESLY 



Query; 6018 at-gga — gctgcgtcacctccaacgatggagactgg-aaatggccgatc— 

VIRAL ENVELOPE PROTEItt 134 W SCVTSHDGDWKWPI 

II III I 1 1 I MI III II Ml II I III t III 

Sbjct; 6064 otgggattg^tgag-^acc acg — gg-gaccggttattgg-ctatcta 

env protein 172 WDCE T T G T G Y W , L S 



Query; 6070 -caggaccgggt-aaaattctcctttgtcaattccggcccgggcaagtacaa 

VIRAL ENVELOPE PXDTEIN 151 QPRV KFSFVNSGPGKYK 

Mill i I II I Ml II I till Ml 

Sbjct s 6112 aaaagacc — tcataa-- c tgtaaaat — — —ggg-a — ccaa 

env protein 187 k d lit vk wd.q 



Query: 6124 atgaaact-atataaagat — -aagagctgctcccca-tcagacttagat-t 

VIRAL ENVELOPE PROTEIN 169 MKL YKD JC 3 C 3 P 3 D L D 

IM III I Ml II II II 1 1 M Ml III! Ill 
Sbjct; 6151 atg-gactca—aaa-atttcaacag-tg-tcaccagaccggct — ggtgt 

env protein 201 WTQ KFQQCHQTG W c 



Query t 6173 taaagata-agtttcactgaaaggaa^Aeaggaaaa-tatteaa-aagtgga 

VIRAL ENVELOPE PROTEIN 185 L K I SFTERKTGJC Y S KVD 

MM Ml I MUM 1 M II MMIM Ml III I II I I 

Sbjct: 6202 taaa-ataga^tttcac— a gacaa-aggaaaattatccaaggactgga 

env protein 218 KID FT DKGKLSKDW 



Query: 6230 atgagctggggaat-agttttttattatatg-g-cggggga-gcagg-gtcc 

VIRAL ENVELOPE PROTEIN 204 Y E L G N SFLLY G GG AG 5 

1 1 M Mill I II MS MM I i M ) MM 11 I 

Sbjct: 6254 aaaacctggggattaag -attctatgtgtctggacatccaggcgtae 

env protein 235 KTWGLR FYV5GHPGV 

htfp://www.i^^ 22/02/0 1 



MAR 23 2001 05:55 



0141 221 7739 



